Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

A hierarchical approach to recognition of handwritten Bangla characters

Identifieur interne : 000A99 ( Main/Exploration ); précédent : 000A98; suivant : 000B00

A hierarchical approach to recognition of handwritten Bangla characters

Auteurs : Subhadip Basu [Inde] ; Nibaran Das [Inde] ; Ram Sarkar [Inde] ; Mahantapas Kundu [Inde] ; Mita Nasipuri [Inde] ; DIPAK KUMAR BASU [Inde]

Source :

RBID : Pascal:09-0201963

Descripteurs français

English descriptors

Abstract

A novel hierarchical approach is presented here for optical character recognition (OCR) of handwritten Bangla words. Instead of dealing with isolated characters as found in selected works [T.K. Bhowmik, U. Bhattacharya, S.K. Parui, Recognition of Bangla handwritten characters using an MLP classifier based on stroke features, in: Proceedings of the ICONIP, Kolkata, India, 2004, pp. 814-819; K. Roy, U. Pal, F. Kimura, Bangla handwritten character recognition, in: Proceedings of the Second Indian International Conference on Artificial Intelligence (IICAI), 2005, pp. 431-443; S. Basu, N. Das, R. Sarkar, M. Kundu, M. Nasipuri, D.K. Basu, Handwritten Bangla alphabet recognition using an MLP based classifier, in: Proceedings of the Second National Conference on Computer Processing of Bangla, Dhaka, 2005, pp. 285-291; A.F.R. Rahman, R. Rahman, M.C. Fairhurst, Recognition of handwritten Bengali characters: a novel multistage approach, Pattern Recognition 35, 2002, pp. 997-1006; U. Bhattacharya, S.K. Parui, M. Sridhar, F. Kimura, Two-stage recognition of handwritten Bangla alphanumeric characters using neural classifiers, in: Proceedings of the Second Indian International Conference on Artificial Intelligence (IICAI), 2005, pp. 1357-1376; U. Bhattacharya, M. Sridhar, S.K. Parui, On recognition of handwritten Bangla characters, in: Proceedings of the ICVGIP-06, Lecture Notes in Computer Science, vol. 4338, 2006, pp. 817-828], the present approach segments a word image on Matra hierarchy, then recognizes the individual word segments and finally identifies the constituent characters of the word image through intelligent combination of recognition decisions of the associated word segments. Due to possible appearances of consecutive characters of Bangla words on overlapping character positions, segmentation of Bangla word images is not easy. For successful OCR of handwritten Bangla text, not only recognition but also segmentation of word images are important. In this respect the present hierarchical approach deals with both segmentation and recognition of handwritten Bangla word images for a complete solution to handwritten word recognition problem, an essential area of OCR of handwritten Bangla text. In dealing with certain category of word segments, created on Matra hierarchy, a sophisticated recognition technique, viz., two-pass approach [S. Basu, C. Chaudhury, M. Kundu, M. Nasipuri, D.K. Basu, A two pass approach to pattern classification, in: N.R. Pal et al. (Ed.), Lecture Notes in Computer Science, vol. 3316, ICONIP, Kolkata, 2004, pp. 781-786] is employed here. The degree of sophistication of the classification technique is also rationally tuned depending on various categories of word segments to be recognized. For example, the two-pass approach is employed here for recognizing middle zone character segments, whereas recognition of middle zone modified shapes of Bangla script is done through simple template matching. Considering learning and generalization abilities of multilayer perceptrons (MLPs), MLP based pattern classifiers are used here for most of the classification related tasks. A powerful feature set is also designed under this work for recognition of complex character patterns using three types of topological features, viz., longest-run features, modified shadow features and octant-centroid features. In a nutshell, the work deals with a practical problem of OCR of Bangla text involving recognition as well as segmentation of constituent characters of handwritten Bangla words.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">A hierarchical approach to recognition of handwritten Bangla characters</title>
<author>
<name sortKey="Basu, Subhadip" sort="Basu, Subhadip" uniqKey="Basu S" first="Subhadip" last="Basu">Subhadip Basu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Das, Nibaran" sort="Das, Nibaran" uniqKey="Das N" first="Nibaran" last="Das">Nibaran Das</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sarkar, Ram" sort="Sarkar, Ram" uniqKey="Sarkar R" first="Ram" last="Sarkar">Ram Sarkar</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kundu, Mahantapas" sort="Kundu, Mahantapas" uniqKey="Kundu M" first="Mahantapas" last="Kundu">Mahantapas Kundu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nasipuri, Mita" sort="Nasipuri, Mita" uniqKey="Nasipuri M" first="Mita" last="Nasipuri">Mita Nasipuri</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Dipak Kumar Basu" sort="Dipak Kumar Basu" uniqKey="Dipak Kumar Basu" last="Dipak Kumar Basu">DIPAK KUMAR BASU</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">09-0201963</idno>
<date when="2009">2009</date>
<idno type="stanalyst">PASCAL 09-0201963 INIST</idno>
<idno type="RBID">Pascal:09-0201963</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000229</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000550</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000216</idno>
<idno type="wicri:doubleKey">0031-3203:2009:Basu S:a:hierarchical:approach</idno>
<idno type="wicri:Area/Main/Merge">000B10</idno>
<idno type="wicri:Area/Main/Curation">000A99</idno>
<idno type="wicri:Area/Main/Exploration">000A99</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">A hierarchical approach to recognition of handwritten Bangla characters</title>
<author>
<name sortKey="Basu, Subhadip" sort="Basu, Subhadip" uniqKey="Basu S" first="Subhadip" last="Basu">Subhadip Basu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Das, Nibaran" sort="Das, Nibaran" uniqKey="Das N" first="Nibaran" last="Das">Nibaran Das</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Sarkar, Ram" sort="Sarkar, Ram" uniqKey="Sarkar R" first="Ram" last="Sarkar">Ram Sarkar</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Kundu, Mahantapas" sort="Kundu, Mahantapas" uniqKey="Kundu M" first="Mahantapas" last="Kundu">Mahantapas Kundu</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Nasipuri, Mita" sort="Nasipuri, Mita" uniqKey="Nasipuri M" first="Mita" last="Nasipuri">Mita Nasipuri</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Dipak Kumar Basu" sort="Dipak Kumar Basu" uniqKey="Dipak Kumar Basu" last="Dipak Kumar Basu">DIPAK KUMAR BASU</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Computer Science and Engineerring Department, Jadavpur University</s1>
<s2>Kolkata 700032</s2>
<s3>IND</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
<sZ>4 aut.</sZ>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>Inde</country>
<wicri:noRegion>Kolkata 700032</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Pattern recognition</title>
<title level="j" type="abbreviated">Pattern recogn.</title>
<idno type="ISSN">0031-3203</idno>
<imprint>
<date when="2009">2009</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Pattern recognition</title>
<title level="j" type="abbreviated">Pattern recogn.</title>
<idno type="ISSN">0031-3203</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Artificial intelligence</term>
<term>Automatic classification</term>
<term>Character recognition</term>
<term>Handwritten character recognition</term>
<term>Hierarchical classification</term>
<term>Hierarchized structure</term>
<term>Image processing</term>
<term>Image segmentation</term>
<term>India</term>
<term>Learning</term>
<term>Manuscript character</term>
<term>Multilayer perceptrons</term>
<term>Multistage method</term>
<term>Neural network</term>
<term>Optical character recognition</term>
<term>Pattern classification</term>
<term>Pattern matching</term>
<term>Pattern recognition</term>
<term>Shadow</term>
<term>Signal classification</term>
<term>Speech processing</term>
<term>Speech recognition</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Caractère manuscrit</term>
<term>Reconnaissance optique caractère</term>
<term>Réseau neuronal</term>
<term>Classification automatique</term>
<term>Inde</term>
<term>Reconnaissance caractère manuscrit</term>
<term>Intelligence artificielle</term>
<term>Reconnaissance caractère</term>
<term>Méthode section divisée</term>
<term>Reconnaissance forme</term>
<term>Segmentation image</term>
<term>Structure hiérarchisée</term>
<term>Reconnaissance parole</term>
<term>Classification forme</term>
<term>Concordance forme</term>
<term>Apprentissage</term>
<term>Perceptron multicouche</term>
<term>Ombre</term>
<term>Classification hiérarchique</term>
<term>Classification signal</term>
<term>Traitement image</term>
<term>Traitement parole</term>
</keywords>
<keywords scheme="Wicri" type="geographic" xml:lang="fr">
<term>Inde</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Intelligence artificielle</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">A novel hierarchical approach is presented here for optical character recognition (OCR) of handwritten Bangla words. Instead of dealing with isolated characters as found in selected works [T.K. Bhowmik, U. Bhattacharya, S.K. Parui, Recognition of Bangla handwritten characters using an MLP classifier based on stroke features, in: Proceedings of the ICONIP, Kolkata, India, 2004, pp. 814-819; K. Roy, U. Pal, F. Kimura, Bangla handwritten character recognition, in: Proceedings of the Second Indian International Conference on Artificial Intelligence (IICAI), 2005, pp. 431-443; S. Basu, N. Das, R. Sarkar, M. Kundu, M. Nasipuri, D.K. Basu, Handwritten Bangla alphabet recognition using an MLP based classifier, in: Proceedings of the Second National Conference on Computer Processing of Bangla, Dhaka, 2005, pp. 285-291; A.F.R. Rahman, R. Rahman, M.C. Fairhurst, Recognition of handwritten Bengali characters: a novel multistage approach, Pattern Recognition 35, 2002, pp. 997-1006; U. Bhattacharya, S.K. Parui, M. Sridhar, F. Kimura, Two-stage recognition of handwritten Bangla alphanumeric characters using neural classifiers, in: Proceedings of the Second Indian International Conference on Artificial Intelligence (IICAI), 2005, pp. 1357-1376; U. Bhattacharya, M. Sridhar, S.K. Parui, On recognition of handwritten Bangla characters, in: Proceedings of the ICVGIP-06, Lecture Notes in Computer Science, vol. 4338, 2006, pp. 817-828], the present approach segments a word image on Matra hierarchy, then recognizes the individual word segments and finally identifies the constituent characters of the word image through intelligent combination of recognition decisions of the associated word segments. Due to possible appearances of consecutive characters of Bangla words on overlapping character positions, segmentation of Bangla word images is not easy. For successful OCR of handwritten Bangla text, not only recognition but also segmentation of word images are important. In this respect the present hierarchical approach deals with both segmentation and recognition of handwritten Bangla word images for a complete solution to handwritten word recognition problem, an essential area of OCR of handwritten Bangla text. In dealing with certain category of word segments, created on Matra hierarchy, a sophisticated recognition technique, viz., two-pass approach [S. Basu, C. Chaudhury, M. Kundu, M. Nasipuri, D.K. Basu, A two pass approach to pattern classification, in: N.R. Pal et al. (Ed.), Lecture Notes in Computer Science, vol. 3316, ICONIP, Kolkata, 2004, pp. 781-786] is employed here. The degree of sophistication of the classification technique is also rationally tuned depending on various categories of word segments to be recognized. For example, the two-pass approach is employed here for recognizing middle zone character segments, whereas recognition of middle zone modified shapes of Bangla script is done through simple template matching. Considering learning and generalization abilities of multilayer perceptrons (MLPs), MLP based pattern classifiers are used here for most of the classification related tasks. A powerful feature set is also designed under this work for recognition of complex character patterns using three types of topological features, viz., longest-run features, modified shadow features and octant-centroid features. In a nutshell, the work deals with a practical problem of OCR of Bangla text involving recognition as well as segmentation of constituent characters of handwritten Bangla words.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Inde</li>
</country>
</list>
<tree>
<country name="Inde">
<noRegion>
<name sortKey="Basu, Subhadip" sort="Basu, Subhadip" uniqKey="Basu S" first="Subhadip" last="Basu">Subhadip Basu</name>
</noRegion>
<name sortKey="Das, Nibaran" sort="Das, Nibaran" uniqKey="Das N" first="Nibaran" last="Das">Nibaran Das</name>
<name sortKey="Dipak Kumar Basu" sort="Dipak Kumar Basu" uniqKey="Dipak Kumar Basu" last="Dipak Kumar Basu">DIPAK KUMAR BASU</name>
<name sortKey="Kundu, Mahantapas" sort="Kundu, Mahantapas" uniqKey="Kundu M" first="Mahantapas" last="Kundu">Mahantapas Kundu</name>
<name sortKey="Nasipuri, Mita" sort="Nasipuri, Mita" uniqKey="Nasipuri M" first="Mita" last="Nasipuri">Mita Nasipuri</name>
<name sortKey="Sarkar, Ram" sort="Sarkar, Ram" uniqKey="Sarkar R" first="Ram" last="Sarkar">Ram Sarkar</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000A99 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000A99 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:09-0201963
   |texte=   A hierarchical approach to recognition of handwritten Bangla characters
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024